BIOTECHNOLOGY 
SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS; PLEASE USE- THE CHECKER 
VERSION 4.2 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment- given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

"2. U.S. Postal Service: Commissioner for Patents; l P~0: Box 1450, Alexandria, VA 22313-1450 r- 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 06/05/04): 

U.S. Patent and Trademark Office, 220 20 th Street S., Customer Window, Mail Stop Sequence, Crystal Plaza Two, Lobby, 
Room 1B03, Arlington, VA 22202 



Revised 05/17/04 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

I Wrapped Nucleics 

Wrapped Aminos 



to /S) ' /.tf/S 



SUGGESTED CORRECTION SERIAL NUMBER: 

: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



Misaligned Amine 
Numbering 

Non-ASCII 



Variable Length 



Patentln 2.0 
" "bug" 



7 Skipped Sequences 

(OLD RULES) 



The numbering under each 5* amino acid is misaligned. Do not use.tab codes between numbers; 
use space characters, instead, -.r - . . 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved in ASCII text 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused^the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence(s) . missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Donbt insert any subheadings under this heading) 
(xi) SEQUENCE DESCR1PTI0N:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 




11 



Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Jnvalid <213> 
Response 

Use of<220> 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence - 



Sequence(s) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



12 



Patentln 2.0 
""bug" ^ 



Use of <220> to <223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



1 3 Misuse of n/Xaa V can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



PCT 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/511 , 415 



DATE: 10/25/2004 
TIME: 16:56:26 



Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 



if 



If 



3 <110> APPLICANT: Imperial College Innovations Limited 

5 <120> TITLE OF INVENTION: Methods 

7 <130> FILE REFERENCE: ICOY/P28304PC 
C--> 9 <140> CURRENT APPLICATION NUMBER: US/10/511, 415 
C--> 10 <141> CURRENT FILING DATE: 2004-10-14 

12 <150> PRIOR APPLICATION NUMBER: PCT/GB03/01625 

13 <151> PRIOR FILING DATE: 2003-04-15 
15 <160> NUMBER OF SEQ ID NOS: 3 0 
17 <170> SOFTWARE: Patent In version 3.1 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: ' 400 

21 <212> TYPE: PRT 

22 <213> ORGANISM: Homo sapiens 
24 <400> SEQUENCE: 1- 

26 Met Met Asp Leu Arg Asn Thr Pro Ala Lys Ser Leu Asp Lys Phe lie 

27 1 5 10 15 

3 0 Glu Asp Tyr Leu Leu Pro Asp Thr Cys Phe Arg Met Gin lie Asp His 
31 20 25 30 

34 Ala lie Asp lie lie Cys Gly Phe Leu Lys Glu Arg Cys Phe Arg Gly 

35 35 40 45 

38 Ser Ser Tyr Pro Val Cys Val Ser Lys Val Val Lys Gly Gly Ser Ser 

39 50 55 60 

42 Gly Lys Gly Thr Thr Leu Arg Gly Arg Ser Asp Ala Asp Leu Val Val 

43 65 70 75 80 

46 Phe Leu Ser Pro Leu Thr Thr Phe Gin Asp Gin Leu Asn Arg Arg Gly 

47 85 90 95 

50 Glu Phe lie Gin Glu lie Arg Arg Gin Leu Glu Ala Cys Gin Arg Glu 

51 100 105 110 

54 Arg Ala Leu Ser Val Lys Phe Glu Val Gin Ala Pro Arg Trp Gly Asn 

55 115 120 125 

58 Pro Arg Ala Leu Ser Phe Val Leu Ser Ser Leu Gin Leu Gly Glu Gly 

59 130 135 140 

62 Val Glu Phe Asp Val Leu Pro Ala Phe Asp Ala Leu Gly Gin Leu Thr 

63 145 150 155 160 

66 Gly Ser Tyr Lys Pro Asn Pro Gin lie Tyr Val Lys Leu lie Glu Glu 

67 165 170 175 

70 Cys Thr Asp Leu Gin Lys Glu Gly Glu Phe Ser Thr Cys Phe Thr Glu 

71 , 180 185 190 

74 Leu Gin Arg Asp Phe Leu Lys Gin Arg Pro Thr Lys Leu Lys Ser Leu 

75 195 200 205 

78 lie Arg Leu Val Lys His Trp Tyr Gin Asn Cys Lys Lys Lys Leu Gly 

79 210 215 220 

82 Lys Leu Pro Pro Gin Tyr Ala Leu Glu Leu Leu Thr Val Tyr Ala Trp 



Dose Not Comply 
Gonteted Diskette Needec 
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RAW SEQUENCE LISTING DATE: 10/25/2004 

PATENT APPLICATION: US/10/511,415 TIME: 16:56:26 



Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 

83 225 230 235 240 

86 Glu Arg Gly Ser Met Lys Thr His Phe Asn Thr Ala Gin Gly Phe Arg 

87 245 250 255 

90 Thr Val Leu Glu Leu Val lie Asn Tyr Gin Gin Leu Cys lie Tyr Trp 

91 260 265 270 

94 Thr Lys Tyr Tyr Asp Phe Lys Asn Pro lie lie Glu Lys Tyr Leu Arg 

95 275 280 285 

98 Arg Gin Leu Thr Lys Pro Arg Pro Val lie Leu Asp Pro Ala Asp Pro 



99 


290 








295 


300 








102 


Thr Gly 


Asn 


Leu 


Gly 


Gly Gly Asp 


Pro Lys Gly 


Trp 


Arg Gin 


Leu 


Ala 


103 


305 








310 


315 








320 


106 


Gin Glu 


Ala 


Glu 


Ala 


Trp Leu Asn 


Tyr Pro Cys 


Phe 


Lys Asn 


Trp Asp 


107 








325 




330 






335 




110 


Gly Ser 


Pro 


Val 


Ser 


Ser Trp lie 


Leu Leu Ala 


Glu 


Ser Asn 


Ser 


Thr 


111 






340 






345 




350 






114 


Asp Asp 


Glu 


Thr 


Asp 


Asp Pro Arg 


Thr Tyr Gin 


Lys 


Tyr Gly 


Tyr 


lie 


115 




355 






360 






365 






118 


Gly Thr 


His 


Glu 


Tyr 


Pro His Phe 


Ser His Arg 


Pro 


Ser Thr 


Leu 


Gin 


119 


370 








375 




380 








122 


Ala Ala 


Ser 


Thr 


Pro 


Gin Ala Glu Glu Asp Trp 


Thr 


Cys Thr 


He 


Leu 


123 


385 








390 


395 








400 



126 <210> SEQ ID NO: 2 

127 <211> LENGTH: 1590 
12 8 <212> TYPE: DNA 

12 9 <213> ORGANISM: Homo sapiens 

131 <400> SEQUENCE: 2 

132 gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa 
134 aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt 
136 atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt 
138 tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa 
140 ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac 
142 gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct 
144 caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg 
146 cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga 
148 ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa 
150 tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt 
152 ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa 
154 cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct 
156 tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa 
158 aacacagccc aaggatttcg gacggtcttg gaattagtca taaactacca 
160 atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta 
162 cagctcacga aacccaggcc tgtgatcctg gacccggcgg accctacagg 
164 ggtggagacc caaagggttg gaggcagctg gcacaagagg ctgaggcctg 
166 ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct 
168 aacagtacag acgatgagac cgacgatccc aggacgtatc agaaatatgg 
170 acacatgagt accctcattt ctctcataga cccagcacgc tccaggcagc 
172 caggcagaag aggactggac ctgcaccatc ctctgaatgc cagtgcatct 
174 ggctccagtg ttatctggac cagttccttc attttcaggt gggactcttg 
176 gacaaagctc ctcagtgagc tggtgtataa tccaagacag aacccaagtc 



taccccagcc 


60 


ccgcatgcaa 


120 


ccgaggtagc 


180 


gggcaccacc 


240 


cacttttcag 


300 


ggaagcctgt 


360 


gggcaacccc 


420 


gttcgatgtg 


480 


cccccaaatc 


540 


ctccacctgc 


600 


gagcctcatc 


660 


gccacctcag 


720 


aacacatttc 


780 


gcaactctgc 


840 


cctgagaagg 


900 


aaacttgggt 


960 


gctgaattac 


1020 


ggctgaaagc 


1080 


ttacattgga 


1140 


atccacccca 


1200 


tgggggaaag 


1260 


atccagagaa 


1320 


tcctgactcc 


1380 
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RAW SEQUENCE LISTING DATE: 10/25/2004 

PATENT APPLICATION: US/10/511,415 TIME: 16:56:26 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 

178 tggccttcta tgccctctat cctatcatag ataacattct ccacagcctc acttcattcc 1440 
180 acctattctc tgaaaatatt ccctgagaga gaacagagag atttagataa gagaatgaaa 1500 
182 ttccagcctt gactttcttc tgtgcacctg atgggagggt aatgtctaat gtattatcaa 1560 
184 taacaataaa aataaagcaa ataccaaaaa 1590 

187 <210> SEQ ID NO: 3 w I /I J \ 

188 <211> LENGTH: 20 dd W 9 fJLJt 

189 <212> TYPE: DNA^— ^ /^M^ ^ ' ' ^ 

190 <213> ORGANISM:<PCR primer J f OS ~j\M*-> f® U ^ 

192 <400> SEQUENCE: 3 ^ V ^ 

193 ctcactgagg agctttgtct 20 

196 <210> SEQ ID NO: 4 

197 <211> LENGTH: 18 

198 <212> TYPE: DNA 

199 <213> ORGANISM:^ PCR primer' 
201 <400> SEQUENCE: 

2 02 cactgaggag ctttgtcc 18 

205 <210> SEQ ID NO: 5 

206 <211> LENGTH: 21 
2 07 <212> TYPE: DNA. 
208 <213> ORGANISM^ PCR primer 

210 <400> SEQUENCE : 

211 caggtgggac tcttgatcca g 21 

214 <210> SEQ ID NO: 6 

215 <211> LENGTH: 20 

216 <212> TYPE: DNP 

217 <213> ORGANISM: ^CR primer^ 

219 <400> SEQUENCE: 6 

220 agggttcctg gccgtgcagg 20 

223 <210> SEQ ID NO: 7 

224 <211> LENGTH: 18 

225 <212> TYPE: DNiy 

226 <213> ORGANISM^£CR primer, 

228 <400> SEQUENCE: 

229 ccgcgctccc tcggctgc 18 

232 <210> SEQ ID NO: 8 

233 <211> LENGTH: 20 

234 <212> TYPE: DOT 

235 <213> ORGANISM^ PCR primer^ 

237 <400> SEQUENCE: 

238 atattctctt tgtaatcagg 20 

241 <210> SEQ ID NO: 9 

242 <211> LENGTH: 20 

243 <212> TYPE: DNA> 

244 <213> ORGANISM^ 

246 <400> SEQUENCE: 

247 aaaaatggca atcactcacc 20 

250 <210> SEQ ID NO: 10 

251 <211> LENGTH: 20 

252 <212> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/10/511, 415 TIME: 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 



10/25/2004 
16:56:26 



253 <213> ORGANISM < PCR prime ^ 

255 <400> SEQUENCE: 10 

256 ccttctatga tttctcctag 

259 <210> SEQ ID NO: 11 

260 <211> LENGTH: 20 

261 <212> TYPE: DNA 

262 <213> ORGANISM: (|CR primer 

264 <400> SEQUENCE: 

265 atccaaaggc aatacgtacc 

268 <210> SEQ ID NO: 12 

269 <211> LENGTH: 20 

270 <212> TYPE: DNA 

271 <213> ORGANISM: (PGR primer 

273 <400> SEQUENCE: 12 " 

274 acagtgtttt atctttaagg 

277 <210> SEQ ID NO: 13 

278 <211> LENGTH: 21 

279 <212> TYPE: DNA> 

280 <213> ORGANISM ^CR primer 

282 <400> SEQUENCE: Ij 

283 gtaacattta ctacttactc g 

286 <210> SEQ ID NO: 14 

287 <211> LENGTH: 20 

288 <212> TYPE: DNA, 

289 <213> ORGANISM CpCR primer^ 

291 <400> SEQUENCE: 14 

292 ccctgttcct tttaactagg 

295 <210> SEQ ID NO: 15 

296 <211> LENGTH: 20 

2 97 <212> TYPE: DNA 
298 <213> ORGANISM<^PCR primer^ 

300 <400> SEQUENCE: 

301 ctcaggatca taatcactgc 

304 <210> SEQ ID NO: 16 

305 <211> LENGTH: 20 

3 06 <212> TYPE: DNA 
3 07 <213> ORGANISM :(pCR primer 

309 <400> SEQUENCE: Tfr- 

310 ctgtgaattt tatacccagg 
313- <210> SEQ ID NO: 17 

314 <211> LENGTH: 21 

315 <212> TYPE: D* 

316 <213> ORGANISM \J>CR primer 

318 <400> SEQUENCE: 

319 gtattacttt ttccacttac c 

322 <210> SEQ ID NO: 18 

323 <211> LENGTH: 20 

324 <212> TYPE: DNA, 

325 <213> ORGANISM: I PCR primer 



20 



20 



20 



21 



20 



20 



20 



21 
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RAW SEQUENCE LISTING DATE: 10/25/2004 

PATENT APPLICATION: US/10/511 , 415 TIME: 16:56:26 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 

327 <400> SEQUENCE: 18 

328 gactctcact gtcattgcag 20 

331 <210> SEQ ID NO: 19 

332 <211> LENGTH: 20 

333 <212> TYPE: DNA^~" " 

334 <213> ORGANISMQPCR primer ) 

336 <400> SEQUENCE :T9 ^ 

337 gtgtcattgc actccagcct 20 

340 <210> SEQ ID NO: 20 

341 <211> LENGTH: 624 

342 <212> TYPE: DNA 

343 <213> ORGANISM: Homo sapiens 

345 <400> SEQUENCE: 2 0 

346 ggatccagat ggcatgtcac agtatactaa atgctcactg aatccagctg caatgcagga 60 
348 agactcccct gatgtgatca tgtgtctcac cctttcaggc tgaaagcaac agtacagacg 120 
350 atgagaccga cgatcccagg acgtatcaga aatatggtta cattggaaca catgagtacc 180 
352 ctcatttctc tcatagaccc agcacgctcc aggcagcatc caccccacag gcagaagagg 240 
354 actggacctg caccatcctc tgaatgccag tgcatcttgg gggaaagggc tccagtgtta 3 00 
356 tctggaccag ttccttcatt ttcaggtggg actcttgatc cagagaagac aaagctcctc 360 
358 agtgagctgg tgtataatcc aagacagaac ccaagtctcc tgactcctgg ccttctatgc 42 0 
360 cctctatcct atcatagata acattctcca cagcctcact tcattccacc tattctctga 480 
362 aaatattccc tgagagagaa cagagagatt tagataagag aatgaaattc cagccttgac 540 
364 tttcttctgt gcacctgatg ggagggtaat gtctaatgta ttatcaataa caataaaaat 600 
366 aaagcaaata ccatttattg ggtg 624 

369 <210> SEQ ID NO: 21 

370 <211> LENGTH: 19 

371 <212> TYPE: DNA 

372 <213> ORGANISM<( j>CR primer^ 

374 <400> SEQUENCE: 21 

375 ggcctggcct gacaactat 19 

378 <210> SEQ ID NO: 22 

379 <211> LENGTH: 19 

380 <212> TYPE: DNA 

381 <213> ORGANISM ><^P CR prime r. 

383 <400> SEQUENCE: 22 

384 catccaagcc tgcacgtat 19 

387 <210> SEQ ID NO: 23 

388 <211> LENGTH: 20 isXA^f 

389 <212> TYPE: DNA^-— >-■- ^ jO 

390 <213> ORGANiSM^rfT primer J Yh«* tvoes of errors shown exist throughout 

392 <400> SEQUENCE: theSequence listing. Please check subsequent 

393 gctttgtgtg agcaacatgg sequences for similar errors- 20 

396 <210> SEQ ID NO: 24 

397 <211> LENGTH: 20 
3 98 <212> TYPE: DNA. 
399 <213> ORGANISM :(pCR primer 

401 <400> SEQUENCE: 

402 ggctcatctg gtctctccag ^ 20 
405 <210> SEQ ID NO: 25 
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VERIFICATION SUMMARY DATE: 10/25/2004 

PATENT APPLICATION: US/10/511,415 TIME: 16:56:27 

Input Set : A:\PTO.FG.txt 

Output Set: N:\CRF4\10252004\J511415.raw 

L:9 M:270 C: Current Application Number differs, Replaced Application Number 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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